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Abstract. The unified property roughly states that searching for an 
element is fast when the current access is close to a recent access. Here, 
close refers to rank distance measured among all elements stored by the 
dictionary. We show that distance need not be measured this way: in 
fact, it is only necessary to consider a small working-set of elements to 
measure this rank distance. This results in a data structure with access 
time that is an improvement upon those offered by the unified property 
for many query sequences. 



1 Bounds for searching: notation and history 

Comparison-based searching is one of the most fundamental operations in com- 
puter science: given a set S oi n totally ordered items, create a data structure 
that, given a query key x, will return the largest key in S that is no larger than 
X. This is called predecessor search. We focus on the case where S is static, and 
thus can be assumed to be the integers from 1 to n. We refer to a search that 
returns x as an access to x. Let A = (ai, 02, . . . denote a sequence of accesses 
to be performed on a data structure, with m chosen to be sufficiently large to 
absorb any start-up costs. 

Any comparison-based search data structure is, at its core, a method of 
choosing which comparisons to perform in order to execute an access. The data 
structure is essentially a way of encoding a comparison tree to execute each 
access — the data structure could do this in an explicit way like in a binary search 
tree, or in an implicit way as in the binary search algorithm. It has been long- 
known that information theory tells us that the worst-case time for an access 
must be i7(logn), and that O(logn) can be achieved with data structures such 
as binary search on a sorted array. 

But, worst-case analysis is not the end of the story, as one can design data 
structures that execute operations in o(logn) time if the operations have some 
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kind of order to them. Thus, we can create data structures with access times that 
are functions of the access sequences themselves (or some distributional statis- 
tic of the access sequence) — these runtimes will still have O(logn) worst-case 
behavior, but will be faster over sequences that have certain desirable charac- 
teristics. We now review some runtime bounds that have been introduced, and 
the data structures whose runtimes have bounds (wc will say a data structure 
has a bound to mean that its runtime can be bounded by the bound): 

Static optimality bound. If the number of searches in A to a; is f{x), then the 
runtime to search for is o|^log j^-j^ Knuth showed how to achieve this 
bound if the /(•)-values are given in advance [T]. 

Working-set bound. Let Wi{x) be the number of distinct items accessed since 
the last access to a; in ai, . . .ai-i. A data structure has the working-set bound 
if an access to a,; takes time 0{\og'Wi{ai)). The idea behind this is that if the 
accesses are restricted to a subset of k items, then the accesses will take time 
O(logfc) rather than O(logn). It has been shown that the working-set bound 
implies the static optimality bound in the amortized sense. Splay trees [2] have 
the working-set bound in the amortized sense, while the working-set structure 
[3] was designed to have this bound in the worst-case. 

Queueish bound J^. The working-set bound requires that items which were 
accessed recently take less time than those that have not been accessed in a 
while. The queueish bound reverses this and states that the time to access ai 
should be C'(log(n — Wi(ai))); thus any structure with the queueish bound will 
execute the least recently accessed item in constant time. No dictionary is known 
to have the queueish bound and it remains open whether such a dictionary can 
exist; however, it was shown that there is a structure with a close-to-queueish 
bound of 0(logn — if,;(a,;) + loglogn) amortized access time. 

Dynamic finger bound |51 6f . Let d{x, y) be the number of keys between x and 
y va. S (this is just \x — y\ if 5* is the integers from 1 to n). The dynamic finger 
property says the cost to execute search is 0(logd(ai_i, a^)). Level-linked 
trees [7] have the dynamic finger property in the worst-case, and splay trees 
have the amortized dynamic finger property. 

Unified bound The dynamic finger bound and the working-set bounds are 
the best known bounds on the runtime of splay trees, yet neither implies the 
other and neither is tight. For example in the sequence (1, 2, . . . n, 1, 2, . . .) the 
dynamic finger will give a bound of 0(1) per operation on average, while the 
working-set bound gives a bound of O(logri) per operation. In the sequence 
(l,ri, 1,71, . . .), the situation is reversed. The unified bound was proposed as a 

* In this paper, log = log 1 = 1. 
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natural combination of these two bounds^ an access is fast if it is close in key 
value to something that has been recently accessed. Formally, a data structure 
has the unified bound if accessing takes time 0(logminj(d(ai, j) +'Wi{j))). 
This clearly implies the working-set bound (set j = ai) and the dynamic finger 
bound (set j ~ ai-i). A non-tree structure was presented with the unified bound, 
and it is conjectured that splay trees have the unified bound. A binary search 
tree (EST) structure with the unified bound plus an additive 0(loglogri,) is 
known [5], and a EST structure with the unified bound was claimed [TU], but 
later declared to be buggy [TT] . 

There are several issues that are important when considering a bound: 

Static vs. dynamic. If a search algorithm uses the same search tree for every 
access, it is said to be static, while if the comparisons performed to execute 
a given search depend upon the previous searches performed it is said to be 
dynamic. The static optimality bound is the best bound possible if the search 
algorithm generates the same comparison tree for every access. 

Online. A bound where the runtime bound to execute is a function of the 
sequence (oi . . . Ui) is said to be an online bound. All of the bounds listed above, 
except the static optimality, bound are online. The static optimality bound is 
not online because it is computed as a function of the frequency count over the 
entire length of a sequence. 

Amortization. For any operation there can be at most 0(2^^) different searches 
than can be done using at most k comparisons. Any bound that at any time 
that has a;(2'^) different searches perform only k comparisons for some value of 
k means that the bound can not hold in the worst case. None of the bounds 
above require amortization, and if a bound does require amortization, that is 
probably a sign that it is somehow unnatural. 

Binary Search Tree model. Wilber |12| formalized the binary search tree model; 
in this model the data structure is a binary search tree which can be restructured 
through the use of rotations. The set of sequences which binary search trees can 
execute quickly seems to be a reasonable classifier of those sequences that we 
consider to be natural. Without a restriction to the EST model, given any single 
access sequence, it is possible to create a data structure that will execute the 
searches in that sequence quickly, and others slowly. This is not possible in 
the EST model as there are deterministic sequences such as the bit reversal 
permutation that can not be executed faster than O(logn) amortized time. 

The class of EST data structures also have the possibility that there may exist 
an online EST data structure that can execute every access sequence asymptot- 
ically as fast as the best EST data structure for that sequence. Such a structure 

® In an unfortunate naming conflict, Sleator and Tarjan have a "Unified Theorem" for 
splay trees [2l Theorem 5] and the bound in the Unified Theorem is also sometimes 
called the "unified bound." 
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would be called dynamically optimal; no BSTs are known to be dynamically 
optimal although spay trees and Lucas' trees [13] are conjectured to be. Blum 
et. al. jl4[ gave a non-tree data structure that runs within a constant factor of 
comparisons of any EST data structure, but requires superpoloynomial time to 
decide which comparisons to perform. Tango trees |15j are a EST that execute 
every sequence within a O(loglogn) factor of the best possible binary search 
tree. Of the bounds described above, no EST can have the queueish property, it 
is conjectured that there is a EST with the unified property, and the rest of the 
bounds described above are achievable by EST data structures. 



2 Problems with the unified bound 

The unified bound is the best proposed bound for binary search trees, and seems 
to be a reasonable combination of temporal locality and locality in keyspace. 
However, we will show that the unified bound has a flawed view of keyspace, 
and propose a new bound that attempts to rectify this fiaw. 

Recall that the unified property roughly states than access is fast when the 
current access is close to a recent access. For example, consider the following 
access sequence (assume n is even and n divides m): 

A=(l,- + l,2,-+2,3,-+3,...,-,n) (1) 

(The exponentiation denotes that the sequence is repeated m/n times to make 
a sequence of length m.) Observe that, except for the first two accesses in each 
cycle, every access is at distance one from the element accessed two accesses ago. 
A dictionary with the unified property would therefore perform this sequences 
in time at most 

2{m/n) ■ O(logn) + (n - 2){m/n) ■ 0(1) € 0{m) 

for an amortized cost of 0(1) per access. 

Next, consider the following access sequence: 

A' =(K,-+K,2K,- + 2K,iK,-+iK,...-,n) (2) 

where n is a multiple of K, mK is a multiple of n, and n^/"* < K < ^fn. For this 
sequence, the unified bound is useless: any element accessed less than Kj^ time 
units in the past is at distance at least K from the currently accessed element, 
so the cost of every access is i7(log A') = f2(logn). 

On the other hand, the sequence A! is not very different from A. Indeed A! 
can be viewed as the sequence A over a larger set, 5, in which a (1 — 1 /if) fraction 
of the elements are never accessed. Intuitively, in a good data structure these 
irrelevant elements should "fall out of the way" in order to speed up accesses to 
the important elements (multiples of if). 
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The sequence A' demonstrates the problem with the unified property: The 
distance function d{x, y) simply measures the number of keys between x and y. 
But, suppose some key values have not been accessed in a long time relative to x 
and J/, or in the extreme case, have never been accessed. Why should the number 
of such keys between x and y influence the runtime of accessing keys such as x 
and y? Put simply, they should not. Data structures such as splay trees will have 
items that arc never accessed "percolate" to to bottom of the structure, and the 
runtime of a splay tree with a subtree of never-accessed keys is identical to the 
runtime if the keys are not there. Thus we need a more nuanccd c?(-, •) function 
that "forgets" keys that have not been accessed in a while when computing key 
distance. 

In this paper, we will expand on the idea of counting only recently accessed 
elements towards the distance between elements stored by the dictionary. The 
remainder of the paper is organized in the following way. In Section [2J we define 
a new, stronger version of the unified property. In Section 21 we show how to 
construct a dictionary that has this new property. We conclude with possible 
directions for future research in Section [Sj 



3 Defining the fresh-finger property 

In this section, we define a new, stronger version of the unified property that 
we term the fresh-finger property. Recall that A = (ai, 02, ... , a,m) is our access 
sequence. Define 

li{x) = min {{co} U { j > | ai_j x}) 

One can think of li{x) as the most recent time x has been queried in A before 
time i. We then define 



Wi{x) 



n if li{x) = 00 

|{ai_i^(2.)+i,...,ai}| otherwise 



which is the working-set number of x at time i. Wc also define Wi{i) to be the 
set of all elements a; £ S* at time i such that 'Wi{x) < j, i.e., the set of all elements 
with working-set number at most j at time i. Next, we define dxix, y) to be the 
rank distance between a and b in the set T, i.e., 



dT{x,y) 

Finally, define 



\{z G T : X < z < y}\ ii x < y 
\{z £ T : y < z < x}\ otherwise. 



yi{x,T) = argminti;i(2/) +dT{x,y) 
yeT 

We are now ready to define the fresh- finger property. In terms of the preceding 
notation, the unified property states that the time to access the element x S S* 
at time i is 

0(logiw,iy,ix, S)) + dsix, y,{x, S)))) (3) 
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A first attempt at defining the fresh-finger property might be 

0{\og{w,{y^{x,W^{w^{x)))) + dw^(^^^.(^^-j-j{x , y^{x , W^{w,{x)))))) . (4) 

Equation (j4]) should be contrasted witli the definition of the unified property 
defined by In the rank distance between x and other elements is mea- 
sured only with respect to the set Wi{wi{x)), the set of elements that have been 
accessed since the last access to x (i.e., a set of fresh fingers). In ([3]), the rank 
distance is measured with respect to S, the entire set of elements stored in the 
data structure. 

Since Wi{'Wi{x)) C 5*, (j4]) is certainly a stronger requirement. Unfortunately, 
it is too strong, and there is no comparison-based data structure that can achieve 
this bound in the worst-case. To see this, consider the access sequence 

(l,2,3,...,n,l,x) 

where a; G {1, . . . , n}. Let i — n + 2 (so that ai = x represents the second access 
to x). Then Wi{uui{x)) = {l,x,x + l,x + 2,...,n}. But then the rank difference, 
d'Wi{wi){x, 1), between x and 1 is at most 1 and ^^(l) = 1, so, according to (g]), 
the time to accessing x is at most 

0{\og{w,il) + dw,i^Mx,l)) ^ 0{l) 

But this is true for any x £ {!,... ,ri}, so for any of the n choices for x, ^ 
requires that a data structure execute the access in constant time. This is not 
information-theoretically possible; accessing a randomly chosen x € {1, . . . ,n} 
requires at least logj n comparisons in expectation. 

From the preceding discussion, we conclude that the set in which we measure 
rank distance should be expanded. This leads to the following definition: 

Definition 1. A data structure has the fresh- finger property if its runtime for 
an access is bounded by 

0{\og{wi{y^ix,W.,{w^{xf))) + dw^(^^^(^^y2^{x,y^{x,W,{w^{xy))))) . 

Observe that the set Wi{wi{x)'^) contains elements that have been accessed 
less recently than x. These additional elements will allow us to support the 
rest of the access cost while respecting information-theoretic lower bounds. The 
intuition for expanding the set under consider in this manner is the fact that the 
data structure will consist of substructures that increase doubly-exponentially 
in size, and so by squaring the working-set number under consideration, we take 
advantage of elements in an adjacent substructure. 

For brevity, we define 

yi{x) = yi{x,Wi{wi{x)'^)) 

and 

FF,(a;) = \og{wi{yi{x)) + dw.(^^.(^^y){x,yi{x))) 
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As an example, consider the following access sequence, where 15 is the ele- 
ment currently being accessed at the end of the sequence. 



lVi(iOi(15)^) 



2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 1, 15 



W^{Wi{15)) 



The original definition of the fresh-finger property uses VKj(u),;(15)), while 
the modified definition uses Wi(wi(15)^). This modified definition allows for 
9, 10, 11, 12, 13 to contribute to the rank distance. This results in a query time 
that does not violate information-theoretic lower bounds, since it does not result 
in a situation where all n queries must be executed in constant time. Before 
presenting a data structure that (nearly) achieves the fresh-finger property, it 
is worth doing a sanity-check of this definition. In particular, we confirm that 
there exists (distributions over) sequences A ~ (ai, . . . ,«,„) such that 



is a lower-bound for accessing ai, . . . , a„i. 

Theorem 1. For all positive integers n, r < n, and m > 2r logn, there exists a 
distribution, A, over {1, . . . ,n}"^ such that, for any comparison-based dictionary 
data structure, D, that stores {1, . . . , n}, and an access sequence A — ai, . . . , am 
drawn from A 

1. FF{ai,...,am) = 0{m\ogr). 

2. the expected number of comparisons performed by D while accessing A is 
f2{m\ogr) . 

Proof. The sequence A is defined as = i for i < r or is selected uniformly 
at random from the set {1, . . . , r} for r < i < m. This choice of A immediately 
implies that 



is a lower-bound on the expected number of comparisons performed by D while 
accessing the (randomly chosen values) a^+i, . . . , Om- This establishes Part 2 of 
the result. 

On the the other hand, to establish Part 1, we have 



m 




(m — r) log2 r > (m/2) log2 r = ]7(mlogr) 




for r < i < m 



for i < r 



Thus, 



FF(ai, . . . , Qm) < r \ogn + (m — r) log(2r) ~ 0(m log r) 



since m > 2rlogn. 
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4 Towards the fresh-finger property 



In this section, we describe a data structure that comes to within a small additive 
term of achieving the fresh-finger property. 

4.1 The data structure 

The data structure consists of k finger search trees Ti , T2 , . . . , as well as k 
accompanying queues Qi,Q2, ■ ■ ■ ,Qk- Recall that finger search trees can support 
insertions and deletions in 0(1) worst-case time (when provided with a pointer 
to the element to be deleted) and finger searches in O(logd) worst-case time, 
where d is the distance between the element being searched for and the supplied 
pointer into the data structure [TB] . 

The size of Tj is 2^^, except for Tk which has size n. It follows that k is 
O(loglogn). We will maintain the invariant that Tj C Tj+i for all 1 < j < n. 
The queue Qj contains exactly the same elements as Tj in the order they were 
inserted into Tj. Pointers arc maintained between elements in the queue and 
corresponding elements in the finger search tree. 

To perform a search, we will perform finger searches for x in Ti, T2, . . . until 
we find X for the first time (say, x G Tj). In Ti, we use an arbitrary element as 
the starting finger for the search. In all other trees, we run two finger searches for 
X in parallel: one from the successor of the element found in the previous finger 
search tree, and one from the predecessor of the element found in the previous 
finger search tree. As soon as the first of these two searches terminates, we stop 
the other. 

To restructure the data structure after wc have found x £ Tj, we must insert 
X into Ti, . . . , Tj^i (note that x is not present in any of these trees, since if 
it were, it would have already been found) and enqueue a; in Qi, Q21 ■ ■ ■ , Qj-i- 
At this point, we note that each of Ti, T2, . . . , Tj^i and Qi,Q2, ■ ■ ■ , Qj-i are too 
big. We therefore dequeue the oldest element in each of Qi,Q2, ■ ■ ■ ,Qj~i and 
delete the corresponding elements in Ti, T2, . . . , Tj_i. 

4.2 Analysis 

Recall that we arc aiming for a running time of 

0{Ffi{x)) ^ 0{log{wi{yi{x)) + dw,{ioi{xr-){x,yi{x)))) 

Consider a search for x at time i, and consider the element yi{x). Suppose 
X first appears in Tj and yi{x) first appears in Tji. Because x first appears in 
Tj, we have that Wi{x) > 1?' . Therefore, j < loglogu'i(a;) -I- 0(1). Similar 

reasoning shows Wi{yi{x)) > 2^^ \ so that / < \og\ogWi{yi{x)) + 0(1). We 
consider three cases, based on how j compares with j': 
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If j ^ j' (j-G-, X appears no later than yi(a;))0 then the runnmg time follows 
easily: x has working-set number Wi{x) < Wi{yi{x)). The element x can thus be 
found in time J^Li 2' = 0(2^), which is 0(^2^') = 0(log 

The more interesting case occurs when j > j' (i.e., x appears after yi(x)). In 
this case, the algorithm will reach the tree, Tji, containing yi{x) in time 

J2o{\og2'')^J2^'^o[v')=0{logw,{y.{x))) . 

e=i 1=1 

The search in Tji finds both the predecessor and and successor, j/i and y2 of x 
in Tj,. That is, 

yi<x < y2 

and yi{x) is not in the open interval {yi,y2)- In particular, for any set T one of 
yi or ?/2, say yi, has 

dT{x,yi) < dT{x,yi{x)) 

Indeed, from this point onwards, every search in Tg, for each £ G {j' + 1, . . . , j}, 
y'^ such that 

dT{x,yi) < dT{x,y^{x)) . 

The elements in Tj/+i, . . . , Tj_i are all in Wi{wi{x)), and so the remaining 
searches in T^v+i, . . . , Tj-i therefore take a total of at most 

U - f - ^)0{\ogdwi(wiix))ix,yi{x))) 
= 0{{\ogdwi(w,{x)){x,yiix))) (log log Wi{x))) 

time. 

At last, the final search, in Tj is the expensive one, since the only guarantee 
we have on the elements of Tj are that their working-set number is at most 
Wi{x)'^. Thus, the elements in Tj are a subset of the elements in Wi{wi{x)'^) and 
the time to search in Tj is at most 

dwi{w,{xy)ix,yi{x)) . 

In either case, the total search time thus far is at most 

0(log w^{y^{x))) 

+ O ( (log dwi (u,, (x)){x, (x) ) ) (log log [x) ) + log d^/. (^^(^)2-^{x,y^{x))) 

At this point, x has been found and wc must now adjust the data struc- 
ture. First, X must be inserted in Ti, T2, . . . , Tj_i. Because we have a finger 
for X inside each of these structures, this takes total time 0(loglogii;i(x)). En- 
queuing X in each of Qi, Q2, ■ • • , Qj-i also takes 0{j) = 0(loglog Wi(x)). The 
subsequent deletions and dequeueings of the oldest elements in Qi, (52, • ■ • , Qj~i 

® In fact, this case can only occur when j = j', since otherwise Wi{yi{x)) > Wi{x), and 
so Wi{yi{x)) + d{x, y) > Wi{x) + d{x, x), which contradicts tlie definition of yi{x). 



9 



and Ti, T2, . . . , Tj-i take a total of 0{j) = 0(loglog w,;(a;)) time as well, since 
the dequeueing operation takes 0(1) time and provides a pointer to the node 
in the corresponding tree where the deletion must be performed. Therefore, all 
restructuring operations take time 0(loglogw,;(a;)). 
We therefore have 

Theorem 2. There exists a static dictionary over the set {1, 2, . . . , n} that sup- 
ports querying element x in worst- case time 

0{FF^{x) + {\ogdw,{wi{x)){x,yi{x))){\og\ogWi{x))) 
5 Conclusion 

In this paper, we defined a stronger version of the unified property and described 
a data structure that achieves it to within a small additive term. Instead of 
computing rank distance over the entire dictionary, we compute rank distance 
only within a working-set containing an element that is close to a recently- 
accessed element. 

There are several possible directions for future research. 

1. One can measure distances within the set Wi{wi{x)^^'^) instead of the set 
Wi{wi{x)'^) by changing how the substructures grow. Is it possible to reduce 
this further? For example, is it possible to measure distances within the set 

W^iOiw^ix)))? 

2. We argued that it is not possible to measure within Wi{wi{x)) in the worst 
case. Is it possible to measure within this set in the amortized sense? 

3. Can the additive term in Theorem [2] be reduced? It seems difficult to re- 
duce this term below f2{loglog Wi{x)) using an approach similar to the one 
presented here, since elements must shift through at least this many sub- 
structures. 

4. We have only considered the case where 5* is static. Is it possible to maintain 
the fresh-finger property while supporting insertions into and deletions from 
S7 
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